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A  Distributed  and  Energy-Efficient  Framework  for 
Neyman-Pearson  Detection  of  Fluctuating  Signals 
in  Large-Scale  Sensor  Networks 

Yang  Yang,  Member,  IEEE,  Rick  S.  Blum,  Fellow,  IEEE,  and  Brian  M.  Sadler,  Fellow,  IEEE 


Abstract—  To  address  the  challenges  inherent  to  a  problem  of 
practical  interest  —  of  Neyman-Pearson  detection  of  fluctuating 
radar  signals  using  wireless  sensor  networks,  we  propose  in 
this  paper  a  distributed  and  energy-efficient  framework.  Such 
framework  is  scalable  with  respect  to  the  network  size,  and  is 
able  to  greatly  reduce  the  dependence  on  the  central  fusion 
center.  It  assumes  a  clustering  infrastructure,  and  addresses 
signal  processing  and  communications  related  issues  arising 
from  different  layers.  This  framework  includes  a  distributed 
scheduling  protocol  and  a  distributed  routing  protocol,  which 
enable  sensor  nodes  to  make  their  own  decisions  about  infor¬ 
mation  transmissions,  without  requiring  the  knowledge  of  the 
network  global  information.  In  this  framework,  energy  efficiency 
manifests  itself  at  different  network  layers  in  a  distributed 
fashion,  and  a  balance  between  the  detection  performance  and 
the  energy  efficiency  is  also  attained. 

Index  Terms— Distributed  routing,  distributed  scheduling, 
Kullback-Leibler  distance,  Neyman-Pearson  criterion,  signal  de¬ 
tection,  and  wireless  sensor  networks. 


I.  Introduction 

FOR  A  GOOD  many  envisioned  applications  of  wireless 
sensor  networks  (WSNs),  signal  detection  based  on  data 
collected  from  distributed  sensors  often  serves  as  the  initial 
step,  taken  prior  to  any  other  processing,  and  has  been 
intensely  studied  over  the  years.  Considering  a  scenario  of 
using  networked  radar-like  sensors  to  detect  the  possible 
presence  of  an  illuminated  target  at  a  specific  geographical 
location,  our  previous  work  [1],  [2]  addressed  the  problem 
of  energy-efficient  routing  for  signal  detection  under  the 
Neyman-Pearson  criterion.  We  assumed  a  powerful  fusion 
center  which  has  the  knowledge  of  the  geographical  location 
of  each  sensor,  determines  the  location  to  be  probed,  and 
more  importantly,  jointly  optimizes  detection  and  routing  in  a 
centralized  manner  by  precomputing  the  routes  as  a  function 
of  the  geographical  location  to  be  monitored.  However,  as 
the  size  of  a  sensor  network  grows  large,  which  is  likely  to 
happen  due  to  sensors’  low  cost  and  small  size  as  well  as  their 
ease  for  deployment,  it  may  appear  demanding  if  such  premise 
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Fig.  1:  System  architecture  of  a  large-scale  wireless  sensor 
network  for  detecting  scattering  signals.  The  6-sided  star 
represents  the  scattering  source  to  be  detected. 

of  a  very  powerful  central  unit  continues  to  be  employed. 
Thus  for  this  problem  of  practical  interest,  it  is  desirable  to 
develop  appropriate  schemes  which  are  scalable  and  able  to 
reduce  the  stringent  requirement  on  the  processing  capability 
of  the  fusion  center.  These  schemes,  in  the  meantime,  should 
optimize  the  detection  performance  of  the  system,  and  should 
address  as  well  the  inherent  design  challenges  of  WSNs, 
including  their  dependence  on  finite  battery  energy. 

In  this  paper,  our  focus  lies  in  determining  the  presence  of 
fluctuating  targets  using  distributed  radar-like  sensors.  Moti¬ 
vated  to  address  the  above-mentioned  challenges  and  to  obtain 
a  useful  and  practically  viable  wireless  sensor  networking 
solution,  we  propose  a  novel  distributed  and  energy-efficient 
framework  for  the  problem  of  signal  detection  in  large-scale 
WSNs  through  active  sensing,  which  to  the  best  of  our  knowl¬ 
edge,  is  not  available  in  the  literature.  This  framework  is  based 
on  a  clustering  infrastructure.  In  other  words,  the  network 
is  divided  into  small  clusters,  and  each  cluster  will  elect  a 
cluster  head.  This  is  illustrated  in  Fig.  1  where  inter-cluster 
communications  are  performed  among  the  cluster  head  nodes 
using  multi-hop  routing,  while  intra-cluster  communications 
are  normally  between  nodes  residing  inside  the  cluster  and  the 
cluster  head.  The  signal  detection  task  is  carried  out  by  the 
non-cluster  head  nodes  inside  each  cluster.  Each  cluster  head, 
acting  as  a  local  control  center,  will  make  a  binary  decision 
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about  the  presence  of  a  target  at  a  specified  location  given 
the  information  it  collects,  and  will  report  to  the  fusion  center 
via  multi-hop  routing  only  when  the  hypothesis  test  is  true. 
This  reduces  the  data  set  that  needs  to  be  transmitted  to  the 
fusion  center,  and  thus  alleviates  the  storage  and  processing 
requirement  on  the  fusion  center. 

The  system  design  involves  several  intertwined  network 
layers,  including  the  physical  layer,  the  media  access  control 
(MAC)  layer,  and  the  routing  layer.  In  fact,  apart  from 
modeling  this  application-specific  sensor  network,  a  major 
contribution  of  our  work  is  that  it  provides  and  integrates 
distributed  solutions  targeted  at  different  layers.  Moreover, 
through  exploiting  a  tight  coupling  between  the  physical  and 
the  upper  layers  (the  MAC  layer  in  particular),  our  framework 
can  attain  appealing  tradeoffs  between  the  energy  efficiency 
and  the  detection  reliability.  The  contributions  of  this  paper 
are  summarized  as  follows: 

•  Related  to  the  physical  layer  is  a  distributed  signal 
processing  scheme  that  includes  a  thorough  analysis  of 
the  Neyman-Pearson  detection  of  fluctuating  narrowband 
radar  signals  from  distributed  sensors,  and  a  proposal  of  a 
channel-capacity-like  detection  performance  metric  based 
on  the  Kullback-Leibler  distance. 

•  By  exploiting  the  statistical  and  instantaneous  informa¬ 
tion  obtained  via  in-network  processing,  we  propose  a 
new  scheduling  protocol  which  allows  each  non-cluster 
head  node  to  make  an  autonomous  decision  about  when 
to  transmit,  and  strikes  a  balance  between  the  detection 
performance  and  the  energy  savings. 

•  To  regulate  the  inter-cluster  communications,  we  present 
a  distributed  routing  scheme  which  allows  each  cluster 
head  to  construct  its  own  routing  decisions  with  its 
available  information,  and  from  consultation  with  imme¬ 
diate  neighbors,  to  minimize  the  overall  energy  cost  for 
information  transmission  to  the  fusion  center. 

The  remainder  of  this  paper  is  organized  as  follows. 
In  Section  II,  we  introduce  the  system  architecture  and  the 
signal  model.  In  Section  III,  we  focus  on  the  Neyman-Pearson 
detection  of  fluctuating  radar  signals,  and  propose  to  use 
the  Kullback-Leibler  distance  as  the  performance  metric.  In 
Section  IV,  we  focus  on  the  distributed  scheduling  design, 
and  propose  a  simple  scheduling  protocol  and  its  distributed 
implementation.  In  Section  V,  we  study  the  multi-hop  routing 
among  cluster  heads,  and  present  a  distributed  routing  algo¬ 
rithm.  Simulation  examples  are  given  in  Section  VI.  Finally, 
Section  VII  concludes  this  paper. 

II.  System  and  Signal  Model 
A.  System  Architecture 

We  consider  a  sensor  network  which  consists  of  a  large 
amount  of  sensors  that  are  distributed  over  a  geographical 
field,  and  a  fusion  center  which  can  be  inside  or  outside 
the  field.  For  simplicity  we  assume  the  clusters  are  formed 
and  do  not  change  dynamically,  while  noting  that  the  cluster 
head  role  can  rotate  among  nodes  to  ensure  energy  fairness. 
Those  clusters  can  be  formed  in  several  different  ways,  one 
of  which,  for  example,  is  the  low-energy  adaptive  clustering 
hierarchy  in  [3],  The  specific  geographical  area  covered  by 


each  cluster  which  is  relatively  small  in  size,  is  further  divided 
into  multiple  range  resolution  cells1  (cf.  Fig.  1).  Each  range 
resolution  cell  is  probed  sequentially  by  sensors  inside  a 
cluster,  using  radar  pulses  and  according  to  some  desired 
schedule. 

We  assume  each  cluster  head  has  the  following  knowl¬ 
edge:  the  locations  of  its  immediate  neighbors  in  the  routing, 
the  range  resolution  cells  which  compose  the  area  covered  by 
its  cluster,  and  the  members  which  belong  to  the  cluster  as 
well  as  each  member's  physical  location.  It,  however,  does 
not  know  the  energy  level  of  each  sensor  inside  the  cluster. 
For  each  non-cluster  head  node,  we  assume  it  knows  its  own 
location  and  the  location  of  its  cluster  head.  It  also  knows  the 
order  of  cell  probing  as  well  as  the  location  of  each  specific 
cell  to  probe.  However,  it  does  not  have  the  knowledge  of 
the  location  and  energy  level  of  each  of  its  neighbors.  Thus, 
cooperation  for  information  transfer  to  the  cluster  head  is 
not  pursued  here,  although  it  could  be  incorporated  into  our 
framework.  We  also  assume  that  the  energy  consumed  for 
sensing  is  in  general,  relatively  small  as  compared  to  that 
dissipated  in  data  communication. 

B.  Signal  Model 

Suppose  sensor  i  is  chosen  to  perform  the  sensing  task 
according  to  a  certain  criterion,  e.g.,  if  its  residual  energy  is 
beyond  a  certain  level  and  its  distance  to  the  target  location  is 
within  a  certain  range.  Then  the  illuminating  signal  sent  from 
sensor  i  can  be  expressed  as 

Si(t)  =  fi(t)  sin  [wit  +  fa ]  (1) 

where  Wi  denotes  the  carrier  frequency,  fi(t)  corresponds  to 
the  signal  envelope,  and  fa  corresponds  to  a  phase  angle.  We 
assume  the  transmitted  signals  are  both  amplitude-  and  phase- 
modulated,  but  Wi  varies  from  sensor  to  sensor  to  reduce  the 
signal  interference.  We  model  the  target  as  a  point  scatterer, 
so  the  reflected  waveform  will  be  an  attenuated  and  varied 
version  of  the  transmitted  waveform.  We  consider  the  situation 
that  both  the  amplitude  and  phase  of  the  received  signal  vary. 
This  happens  when  the  target’s  aspect  or  effective  radar  cross 
section  changes  from  pulse  to  pulse.  In  the  communication 
context,  this  situation  is  usually  referred  to  as  fading.  We 
assume  echo  signals  arrive  at  each  sensor  node  with  indepen¬ 
dent  fading.  Under  Hi  (target  present),  we  model  the  received 
signal  as 

n(t)  =  Vifi(t  -  n )  sin[uij(f  -  n)  +  fa  +  Qi]  +  (2) 

where  r*  is  the  delay,  rii(t)  denotes  the  independently  and 
identically  distributed  Gaussian  noise  with  variance  o v, 
and  Oi  are  independent  random  variables  which  correspond 
to  the  amplitude  variation  and  phase  noise,  respectively.  We 
assume  i>j  has  a  Rayleigh  density  with  parameter  af,.  as 
follows 


’For  the  given  sensor  processing,  it  is  statistically  very  unlikely  to  distin- 
guish  two  or  more  targets  that  are  located  within  the  same  range  resolution 
cell  [4]. 
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We  assume  9i  is  uniformly  distributed  over  [0,  27r],  For  sim¬ 
plicity,  (2)  does  not  account  for  the  clutter  and  Doppler  effects 
that  might  also  occur,  although  these  can  be  incorporated  in 
standard  ways.  Further,  we  also  assume  that  the  path  loss 
attenuation  effect  is  embedded  in  Vi  (thus  erf),  so  that  we  do 
not  need  to  explicitly  deal  with  the  distance  between  the  target 
and  sensor  i.  In  practice,  we  can  use  neighboring  range  cells 
to  estimate  the  noise/clutter  power,  and  the  known  distance 
traveled  by  the  probing  radar  signals  to  estimate  the  relative 
propagation  losses. 

As  the  probe  location  is  known  to  sensor  i,  there  would 
be  no  difference  if  we  simply  let  t*  =  0.  Further,  we  consider 
sampled  signals  in  the  sequel  for  convenience.  Let  T,  be  the 
sampling  interval,  and  K  be  the  number  of  samples  available 
at  sensor  i.  Then  under  hypotheses  Hi  and  //() ,  the  sampled 
waveform  for  k  =  •  ,  K  can  be  written  as 

<h\  -  /  sin[(fc  -  l)wiTi  +  fa  +  9i }  +  rii(k),  Hi 

TiW  ~  \  m(k),  H0 

where  fi(k)  =  fi((k  -  1  )fa),  m(k)  =  Hi((k  -  1)7)),  k  = 
l,---  ,  K  are  the  sampled  waveform  fi(t),  and  noise  rii(t), 
respectively. 


III.  Neyman-Pearson  Detection  of  Fluctuating 
Radar  Signals 

A.  The  Neyman-Pearson  Detector 

By  definition,  the  local  likelihood  ratio  at  sensoi^?'  is 
given  by  (3)  at  the  bottom  of  this  page,  where  ff  = 
(1  /K)  J2k=i  LI  and  if  are  respectively  defined  as 

K 

Lj  =  ^ n(k)fi(k)  cos[(fc  -  1 .)wiTi  +  fa], 

k=d 

K 

LSi  =  ^2  ri(k)fi(k)  sin[(fc  -  1  )iuiTl  +  fa]. 

k—1 


It  does  not  appear  straightforward  to  compute  or  simplify  (3). 
But  fortunately,  it  can  be  shown  that  L\  and  ff  are  sufficient 
statistics  [5].  In  fact,  they  are  also  jointly  Gaussian  under 
both  hypotheses.  For  each  sensor  i,  given  Ho,  L 1  and  L\ 
are  independent  Gaussian  random  variables  and  distributed 
according  to  J\f(0 ,  cr^),  where  =  Kffaf./2.  Similarly, 
under  Hi,  LI  and  L\  are  also  independent  Gaussian  random 
variables,  but  they  are  distributed  according  to  A/”(0,cr^), 
where  of,  =  K  ff  of.  /  2  +  (Kff)2  of  ./A.  Therefore,  the  log- 
likelihood  ratio  at  each  sensor  i  can  be  computed  in  terms  of 
and  Lf,  and  can  be  written  as 


A,-  =  In 


P(Lct,L°\Hi) 

p(lili\h0) 


=  lna0i  -incri 


L2[(i?)2 


where  a2  =  ( a0  2  -  <7u2)/2. 

We  assume  there  are  N  sensors  inside  a  cluster  that 


perform  the  sensing  task.  Additionally,  we  will  use  the  fol¬ 
lowing  conditional  independence  assumption  throughout  the 


paper:  the  observations  of  all  sensors  are  statistically  inde¬ 
pendent,  conditioned  on  the  binary  hypothesis.  Note  that  this 
assumption  is  very  common  in  distributed  detection  problems. 
Thus  L^,  L\,  i  =  1,  ■■■,  N,  are  all  independent  distributed 
Gaussian  variables  under  both  hypotheses.  Then,  the  overall 
log-likelihood  ratio  can  be  calculated  through 

N  N  N 

A  =  E  =  E(Wo,  -  In  4)  +  E  4(£?)2  +  (Li)2]. 

i— 1  i= 1  i= 1 

Incorporating  the  threshold  77,  the  Neyman-Pearson  test  which 
according  to  the  Neyman-Pearson  lemma  [5]  is  a  likelihood 
ratio  test,  can  be  written  as 

N  Hi  N 

Eat[(^i)2  +  (Lsi)2}  I  V- E(lno0i  -incrfj  (4) 

1  1 


where  we  have  defined  the  test  statistic  l  and  the  new  threshold 
7/.  A  detailed  performance  analysis  of  the  above  Neyman- 
Pearson  detector  can  be  found  in  [6,  Appendix  B],  which 
indicates  that  obtaining  a  closed-form  relation  between  the 
probability  of  detection  Pd  and  the  probability  of  false  alarm 
Pp  can  be  a  rather  formidable  task.  Further,  such  performance 
analysis  unfortunately  does  not  immediately  render  much 
insight  into  the  design  of  the  whole  system.  Therefore,  next 
we  propose  to  employ  an  alternative  measure  to  characterize 
the  detection  performance  of  the  Neyman-Pearson  detector. 


B.  The  Kullhack-Leihler  Distance 

We  consider  the  Kullback-Leibler  distance  [7]  as  an 
alternative  performance  index.  The  Kullback-Leibler  distance 
is  the  expected  value  of  the  log-likelihood  ratio  taken  over  the 
density  of  one  of  the  hypotheses.  It  is  one  of  the  most  fre¬ 
quently  used  information-theoretic  distance  measures,  and  is 
related  to  the  Neyman-Pearson  detector’s  performance  through 
Stein’s  lemma  [8,  p.  93].  We  state  this  result  in  the  following 
without  proof: 

Theorem  1  (Stein’s  Lemma):  Consider  a  hypothesis  test 
where  we  want  to  distinguish  between  the  null  probability 
density  po(y)  and  the  alternative  probability  density  pi(y). 
Suppose  we  have  collected  independent  observations  yi,  7/2, 
•••,  t/at.  Assume  D(po\\pi)  <  00,  where  D(po\\pi)  denotes 
the  Kullback-Leibler  distance  or  the  relative  entropy  of  pi  with 
respect  to  true  distribution  po,  and  is  defined  to  be 

L>(po\\pi)  =  [ Po(y)  In  dy. 

J  pi(y ) 

Let  An  be  an  acceptance  region  for  hypothesis  Ho,  and  ACN  be 
the  acceptance  region  for  hypothesis  Hi.  Let  /3jv  =  Pf1  (An) 
denote  the  miss  error  probability,  i.e.,  Prfchoose  Ho\Hi  is 
true)  and  ajv  =  -Po^^at)  denote  the  false  alarm  probability, 
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i.e.,  Pr(choose  Hi\Hq  is  true).  For  0  <  e  <  1/2,  we  define 


Then 


Pn  =  min  Pn- 

aN<e 


iim  Jim  Tf  ^-Pn  =  ~D(p0\\p1). 

6—>0  iV— »oo  iv 


□ 


Stein’s  lemma  indicates  that  the  Kullback-Leibler  dis¬ 
tance  is  the  exponential  rate  at  which  the  error  probabilities  de¬ 
cay  for  a  Neyman-Pearson  detector.  This  result  is  independent 
of  the  false  alarm  level  (i.e.,  upper  bound  constraint  on  Pp)  as 
long  as  such  level  is  neither  0  nor  1  [8,  p.  93].  The  Kullback- 
Leibler  distance  is  additive,  i.e.,  the  distance  between  two  joint 
distributions  of  statistically  independent  distributed  random 
variables  is  equal  to  the  sum  of  the  marginal  distance  [7,  p.12]. 
Therefore,  for  the  problem  that  we  are  interested  in  where  all 
the  Gaussian  random  variables,  Lf  Lf,  i  =  1,  •  •  •  ,  N  are 
independent,  assuming  sufficiently  large  N,  the  probability  of 
miss  is  given  by 


JV 


Pm  «  g{N)  •  exp  \  -  £  Dfp[L\,  Lt\H0)\\p(Ll  L?|tfr)) 


(5) 


(6) 


where  g(N)  is  a  positive-valued  function  that  satisfies 

lim  —  In  g(iV)  =  0. 

N — »oo  N 

Eq.  (6)  states  that  the  logarithm  of  g(N)  increases  more  slowly 
than  a  linear  function.  Because  g(N)  changes  slowly,  the 
asymptotic  variation  of  Pm  with  respect  to  N  is  captured  by 
the  exponential  term,  and  thus  by  the  sum  of  each  individual 
Kullback-Leibler  distance.  For  convenience,  we  use  D  to 
denote  the  sum  term  in  (5),  and  use  Di  for  short  for  each 
term  in  the  summation.  Then,  we  have  the  following  result: 

Lemma  1:  Define  snr,  =  Kffaf.  /2<r2.  as  the  nominal 
signal-to-noise  ratio  at  each  sensor  i.  Then  the  Kullback- 
Leibler  distance  at  each  sensor  i  is  given  by: 

-l 


Di  =  In  (1  +  snr,)  -  1  + 


1 


snr,; 


(7) 


Proof:  Based  on  the  definition,  we  compute  the  Kullback- 
Leibler  distance  Di  as  follows: 


Di  = 


p(LCi,  Ll\H0)  In 


'  —  oo  J  —  oo 
2 


P(LCi,M\Hi) 

=  In  at,  -  ln4  -  a/E{(L?)2_  +  (Lst)2\H0} 

Kff<]  Kffal 


=  In 


1  + 


2*1. 


Kffaf  +  2v* 


Applying  snr;  =  KffafJ^af.,  we  obtain  (7).  ■ 

Note  that  snr;  is  indeed  related  to  the  true  signal-to- 
noise  ratio,  because  the  average  energy  of  the  received  signal 
(excluding  the  noise)  is  given  by 

E  M(fc)  sinKfc  -  l>iTi  +  &  +  ^02|  =  KH<- 

We  present  below  some  properties  of  Di  without  proof. 
Lemma  2:  Di  has  the  following  properties: 


1)  Di  is  strictly  positive. 

2)  Di  is  strictly  increasing  with  respect  to  snr,;. 

3)  Di  is  a  strictly  concave  function  of  snr,  for  snr,;  >  1. 

Lemma  1  and  2,  along  with  Theorem  1,  yield  the  follow¬ 
ing  result: 

Proposition  1:  The  contribution  of  each  sensor  i  to  the 
overall  detection  performance  of  the  Neyman-Pearson  detector 
is  quantified  by  its  Kullback-Leibler  distance  I),. 

Strictly  speaking,  the  Kullback-Leibler  distance  only  de¬ 
termines  asymptotic  performance.  In  other  words,  the  relation 
between  the  Kullback-Leibler  distance  and  Pm,  as  manifested 
in  (5),  does  not  directly  lead  to  a  numeric  estimate  of  Pm- 
Nevertheless,  the  variation  of  Pm  with  respect  to  D  (thus  Dp 
can  be  determined  through  (5)  and  with  assurance  that  the 
accuracy  of  the  predicted  variation  will  improve  as  the  number 
of  independent  observations,  i.e.,  N,  increases.  Therefore,  Di 
lends  itself  to  be  an  amenable  measure  to  determining  a  sen¬ 
sor’s  importance  in  the  Neyman-Pearson  detection.  The  larger 
the  value  of  Di,  the  more  important  a  sensor’s  observations 
for  the  test. 

Remark  1 :  For  relatively  high  snr,,  (7)  reduces  to 

Di  «  In  (1  +  snr,)  -  1.  (8) 

The  first  term  in  (8)  is  well  recognized2,  which  in  com¬ 
munications  denotes  the  channel  capacity  of  the  complex 
Gaussian  channel.  For  this  specific  problem  of  interest,  it  can 
be  interpreted  as  the  amount  of  information  contained  in  the 
observations  of  sensor  i,  or  the  uncertainty  that  is  reduced 
about  the  target  presence  through  collecting  information  from 
sensor  i.  The  second  term  is  a  constant  and  is  the  same  for 
each  different  sensor  under  high  snr,;. 

IV.  Distributed  Scheduling  Design 
A.  Problem  Statement 

After  probing  a  specific  resolution  cell  and  capturing  the 
echo  signal,  the  non-cluster  head  node  which  is  involved  in 
sensing  (hereafter  simply  referred  to  as  sensor  or  sensor  node) 
will  perform  some  form  of  computation  and  get  ready  to  send 
the  post-processed  data  to  the  cluster  head.  A  fundamental 
objective  of  the  scheduling  design  is  to  determine  the  sequence 
of  each  sensor’s  transmission.  But  our  framework  further  seeks 
some  scheduling  scheme  that  can  strike  an  appropriate  balance 
between  the  detection  performance  and  the  energy  savings  in 
a  distributed  manner.  For  convenience,  we  assume  that  all 
the  sensors  in  a  cluster  are  synchronized  at  the  beginning 
of  their  transmissions.  This  can  be  achieved  by  having  the 
cluster  head  send  a  beacon  or  synchronization  message  to  the 
sensors.  During  the  process  of  transmissions,  however,  we  do 
not  assume  that  these  sensors  are  still  synchronized. 

We  firstly  identify  the  parameters  that  should  be  exploited 
in  the  scheduling  design.  As  for  the  energy  efficiency,  the 
residual  energy  of  each  sensor  i,  denoted  as  tpi,  which  equals 
the  sensor’s  current  energy  minus  the  energy  required  to 
send  its  data  to  the  cluster  head  (i.e.,  ep,  serves  as  a  good 

-A  similar  capacity-like  metric  based  on  the  Chemoff  information  was 
obtained  in  [9],  but  for  a  problem  of  detecting  a  correlated  Gaussian  field 
which  is  completely  different  from  and  not  applicable  to  the  radar  detection 
problem. 
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measure  and  directly  reflects  each  sensor  node’s  lifetime  after 
data  transmission.  Di,  as  defined  in  (7),  is  another  parameter 
we  should  take  into  account,  as  it  indicates  the  importance 
of  each  sensor’s  data  in  the  overall  detection  performance 
(cf.  Proposition  1).  In  addition,  the  amplitude  of  a  sensor’s 
local  log-likelihood  ratio  | |  is  one  other  parameter  that 
merits  our  attention.  In  fact,  as  indicated  in  [10],  |A;|  reflects 
the  extremeness  or  informativeness  of  a  sensor’s  observation, 
and  can  be  exploited  to  attain  a  reduced  average  number  of 
transmissions. 

Remark  2:  It  is  worthwhile  to  make  a  remark  about 
Di  and  |A*|.  Di,  as  defined  in  (7),  only  depends  on  a 
sensor's  second-order  statistics,  and  is  noise  free.  It  reflects 
the  statistical  tendency  of  a  sensor’s  contribution  to  the  overall 
detection  performance  in  the  Neyman-Pearson  sense.  |Aj|,  on 
the  other  hand,  contains  information  about  a  sensor’s  real¬ 
time  observation  as  well  as  the  instantaneous  noise.  It  reflects 
a  sensor’s  momentary  importance  to  the  decision  making  for 
a  particular  hypothesis  test.  |A*|  is  related  to  Di  in  a  statistical 
average  sense,  even  monotonic  ally.  But  a  specific  large  |A*| 
does  not  necessarily  suggest  a  large  D(,  and  vice  versa. 

Thus,  in  terms  of  the  system-level  performance,  our 
design  objectives  are  :  1)  to  reduce  the  average  number  of 
transmissions  for  reaching  a  decision;  2)  to  make  sure  the 
decision  has  a  desired  level  of  confidence.  The  first  property  is 
related  to  the  log-likelihood  ratio  at  each  sensor  i.  The  second 
property,  if  cast  mathematically,  can  be 

N 

^  '  Pi  Di  Dq, 

i=l 

where  pi  £  {0, 1}  is  an  indicator  variable  which  indicates 
whether  sensor  i  elects  to  transmit,  and  Dq  is  the  minimum  ac¬ 
ceptable  overall  Kullback-Leibler  distance  which  corresponds 
to  a  certain  quantity  of  the  detection  probability  given  a  fixed 
value  of  false  alarm  probability  Pp.  Meanwhile,  considering 
each  sensor  may  act  selfish  in  order  to  last  longer,  a  desirable 
scheduling  design  should  extend  each  sensor's  lifetime  as  well. 
After  all,  rotating  the  cluster  head  position  is  necessary  for 
maintaining  the  clustering  architecture  (e.g.,  [3]),  and  the  one 
that  becomes  the  new  cluster  head  is  usually  the  one  which 
has  the  largest  residual  energy.  Hence,  the  manifestation  of 
energy  efficiency  at  the  per-sensor  level  can  be  expressed  as 

ma x  i  =  1,  •  •  •  ,  N. 

B.  A  Simple  Distributed  Scheduling  Scheme 

There  exist  some  distributed  MAC  or  scheduling  pro¬ 
tocols  (see  [11]  and  [12]  for  a  survey),  but  they  are  not 
applicable  to  this  problem  because,  among  other  reasons,  de¬ 
tection  performance  is  not  taken  into  account.  Some  schemes 
named  opportunistic  transmission  scheduling,  e.g.,  in  [13]- 
[15],  exploit  the  channel  state  information  in  random  access  to 
achieve  energy  efficiency.  However,  in  this  paper,  the  channel 
state  information  is  not  a  major  concern.  To  circumvent 
this  problem,  based  on  our  previous  work  on  the  energy- 
performance  tradeoff  [2]  and  also  inspired  by  the  ordered 
transmission  approach  [10],  we  introduce  a  priority  concept 
named  the  energy-performance  index,  and  let  sensors  with 
higher  index  values  to  transmit  earlier. 


The  energy-performance  index  for  sensor  i  is  defined  to 
be  5i  =  Pi (Di,  |Aj|,  t/ij),  where  PL  is  a  mapping  function.  More 
specifically,  PL  is  a  real-valued  nonnegative  function  of  I),, 
A,  |,  il>i,  and  in  general  is  increasing  with  respect  to  each  of 
these  parameters.  PL  does  not  need  to  be  a  continuous  function, 
it  can  be  piecewise  for  example,  and  can  assume  different 
forms.  For  instance,  if  we  define  hi(ipi)  as  a  battery  cost 
function  which  is  monotonic  decreasing  with  respect  to  ipi  and 
captures  the  reluctance  of  sensor  i  to  transmit,  then  adopting 
the  idea  of  detection-probability-to-energy  ratio  in  [2],  we  can 
define  the  energy-performance  index  as 

f  _  h2(Di,\ Ai|) 

*  hMi)  ’ 

where  li2(Di,  |A;|)  denotes  a  certain  form  of  combination 
of  Di  and  |A;|,  and  is  monotonically  nondecreasing  with 
regard  to  either  parameter.  Of  course,  the  simplest  form  of 
this  PL  function  is  probably  a  linear  combination  of  all  three 
parameters,  i.e., 

Si  =  £l  Di  +  £2  I  Ai|  +  £3 

where  £1,  £2,  and  £3  are  scaling  coefficients  that  resemble 
Lagrange  multipliers.  In  particular,  when  £1  =  £3  =  0,  this 
corresponds  to  the  ordered  transmission  approach  in  [10], 
where  sensors  with  more  extreme  observations  have  higher 
priority.  If  simply  letting  £1  =  £2  =  0,  it  represents  the 
conservative  energy-dominating  approach  in  which  sensors 
with  more  residual  energy  are  favored.  Thus,  a  desired  tradeoff 
among  these  parameters  Di,  |A$|,  and  can  be  attained  by 
judiciously  choosing  the  mapping  function  PL,  and/or  tuning 
the  coefficients  for  these  parameters.  Note  that  the  properties 
of  Di,  as  stated  in  Lemma  2,  can  also  be  exploited  to  facilitate 
the  design  of  Pi. 

We  further  introduce  a  parameter  named  defer  time,  like 
that  in  [10]  and  [15],  to  determine  when  each  sensor  transmits. 
Defer  time  is  denoted  by  J(8i)  and  is  a  function  of  each 
sensor’s  index  value  Si.  It  is  basically  the  time  that  each  sensor 
has  to  wait  before  transmitting  their  information  to  the  cluster 
head.  It  also  resembles  the  interframe  spacing  time  interval3  in 
IEEE  802.11  CSMA/CA  protocol  for  which  each  node  has  to 
wait  before  transmitting  even  though  it  senses  that  the  medium 
is  idle.  J  should  be  a  non-increasing  function  of  the  index 
value  Si,  such  that  sensors  with  higher  index  values  will  wait 
less  time  before  accessing  the  medium.  J  can  also  take  a 
variety  of  forms.  In  fact,  [10]  adopts  a  very  simple  form  similar 
to  J(Si)  =  1  /Si.  However,  to  reduce  the  chance  of  collisions, 
J  should  be  judiciously  designed  along  with  PL,  so  that  the 
resulting  defer  time  for  different  sensors  is  well  separated. 

In  the  distributed  implementation,  the  functions  of  Pi 
and  J  are  assumed  to  be  available  at  each  sensor,  so  that 
each  sensor  can  compute  its  index  value  Si  and  its  defer 
time  J(5i)  given  its  own  local  information.  Each  sensor  can 
thus  make  an  autonomous  decision  about  when  to  transmit, 
without  consultation  with  any  of  its  neighbors  or  the  cluster 
head.  When  each  sensor  sends,  it  transmits  Di  and  A,,  along 
with  other  necessary  information  for  network  maintenance. 

Assigning  different  interframe  spacing  to  different  traffic  flows  or  users 
based  on  their  priorities  was  considered  in  [16],  in  order  to  introduce  service 
differentiation  into  IEEE  802. 1 1  MAC  sublayer. 
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Fig.  2:  Illustration  of  the  distributed  implementation.  Trans¬ 
missions  will  be  stopped  at  point  (a)  when  a  decision  can  be 
made  with  a  certain  level  of  conbdence,  or  at  point  (b)  when 
the  time  for  a  single  test  expires. 

The  stopping  rule  should  be:  1)  when  a  decision  can  be  made 
with  a  desired  conbdence  level  (i.e.,  D  >  Do);  or  2)  when 
the  time  for  a  particular  test  expires.  A  simple  example  of 
this  distributed  scheduling  approach  is  illustrated  in  Fig.  2.  In 
either  case,  the  cluster  head  can  broadcast  one  beacon  signal 
to  all  these  sensors  to  stop  further  transmissions. 

V.  Distributed  Routing  Design 
A.  Problem  Statement 

Each  cluster  head  will  make  a  binary  decision  based 
on  the  data  it  retrieves  from  the  sensors,  and  it  will  report 
to  the  fusion  center,  once  it  identibes  a  target  at  a  specibc 
range  resolution  cell.  In  addition  to  functioning  in  the  event- 
driven  mode,  each  cluster  head  also  needs  to  periodically 
send  status  information  to  the  fusion  center  for  the  purpose 
of  network  maintenance,  and  serves  as  a  relay  node  for  its 
neighboring  nodes.  Therefore,  it  appears  reasonable  to  assume 
a  continuous  data  delivery  model  for  each  cluster  head.  In 
the  routing  design,  we  are  also  concerned  with  the  energy 
efficiency.  For  simplicity  we  consider  the  energy  dissipated 
at  each  cluster  head  instead  of  the  residual  energy,  although 
residual  energy  can  be  easily  incorporated,  like  that  in  [2].  In 
addition,  it  is  worth  noting  that  routing  in  this  scenario  is  quite 
different  from  that  in  [2],  because  we  are  dealing  with  multiple 
clusters  and  there  could  be  possibly  several  targets  appearing 
in  different  clusters  at  the  same  time.  Hence,  we  assume  each 
cluster  head  has  a  routing  table  for  several  possible  paths  that 
lead  to  the  fusion  center,  and  it  can  choose  which  path(s)  to 
use  by  varying  the  traffic  flow  over  that  path. 

Let  Q  =  (A f,  A)  denote  a  bnite  network  debned  by  a 
set  Af  of  nodes  and  a  set  A  of  directed  arcs.  In  particular, 
we  denote  the  fusion  center  as  node  q,  thus  the  set  of  cluster 
head  nodes  is  given  by  A f\q.  We  denote  by  (i,j)  the  link 
from  node  i  to  node  j.  We  distinguish  link  (i,j)  from  (j,  i), 
and  simply  assume  that  if  one  exists  the  other  does  also.  We 
assume  the  network  is  connected  in  the  sense  that  for  any  two 
nodes  i  and  j ,  there  is  a  directed  path  from  i  and  j ,  which 


is  a  sequence  of  non-repeated  nodes  and  connecting  arcs.  We 
assume  that  the  traffic  load  generated  at  each  cluster  head  is 
moderate.  Thus,  flow  control  or  congestion  control  schemes 
will  not  be  considered  in  this  paper. 

Let  bi  >  0  be  the  expected  traffic  in  bits/sec  generated  at 
each  cluster  head  i  which  consists  of  information-bearing  data 
and  status  information.  We  deflne  bq  =  0.  For  link  let 

eij  be  the  energy  consumption  corresponding  to  sending  one 
unit  traffic  (per  bit/sec)  over  it,  and  Cy  be  the  transmission 
capacity4  of  link  ( i,j ).  Thus,  the  routing  objective  is  to  divide 
the  traffic  among  the  paths  from  each  cluster  head  to  the 
fusion  center  in  a  way  that  the  resulting  total  link  flow  patterns 
minimizes  the  energy  consumption.  The  routing  problem  can 
be  then  formulated  as  follows: 

P.l  :  min  y~"^  e ij  ■  F^ 

(hi) 

s.t.  yy  —  yy  Fji  =  bi 

{j-UA^A} 

0  <  Fij  <  Cij,  V(i,  j)  £  A 

Fqi  =  0,  Vi  G  M\q 

where  Ftj  denotes  the  amount  of  flow  over  link  (i.j)  and  is 
the  decision  variable. 

P.l  is  a  multicommodity  flow  problem,  and  can  be  solved 
relatively  easily  in  a  centralized  manner  with  some  linear 
programming  techniques.  However,  in  general,  it  is  nontrivial 
to  solve  P.l  directly  in  a  distributed  manner5.  As  a  result, 
we  start  by  modifying  P.  1  into  a  slightly  different  formulation 
which  can  be  more  amenable  to  distributed  computation.  Our 
brst  task  is  to  eliminate  the  link  capacity  constraint  FtJ  <  C^. 
One  approach  to  achieve  this  is  to  apply  the  penalty  function 
method.  We  hereby  introduce  a  new  cost  function  Uij  which 
has  the  following  properties: 

1)  Uij  is  a  real-valued  function  only  of  Ft]  and  is  debned 
on  an  interval  [0,  Cij). 

2)  Uij  is  nonnegative  continuous  increasing  with  respect  to 
F^,  and  has  continuous  brst  and  second  derivatives. 

3)  is  strictly  convex. 

4)  Uij  grows  at  an  inbnite  rate  as  Ftj  approaches  the 
boundaries  of  the  domain  of  Uij,  i.e.,  Uij(Fij)  — >  oo 
as  F^  >  Cij . 

A  possible  choice  of  the  cost  function  Uij  could  be,  for 
example, 

Uij  ( Fij )  =  ■  F^  +  —  ——— ,  (9) 

Gjj  rij 

where  a  is  a  positive  scaling  parameter.  Applying  this  new 
cost  function  Uij,  P.l  becomes  a  convex  cost  network  flow 
problem  without  arc  flow  bounds,  which  hereby  is  denoted 
as  problem  P.2,  but  for  brevity  is  not  explicitly  given  here.  It 
can  be  veribed  that  if  Problem  P.2  has  at  least  one  feasible 
solution,  it  will  have  a  unique  optimal  solution  [6]. 

Our  next  task  is  to  debne  a  new  set  of  decision  variables 

4The  capacity  of  a  wireless  link  is  interrelated  with  other  wireless  links  in 
the  neighborhood,  and  is  a  function  of  transmitter  powers  (its  own  and  nodes 
in  the  vicinity)  and  the  channel  conditions. 

'Ill ere  exist  some  works  which  attempt  to  solve  this  type  of  problem  in  a 
distributed  way  (e.g.,  [17]),  but  they  either  do  not  meet  our  design  need,  or 
are  not  amenable  to  implementation. 
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Fig.  3:  A  simple  example  of  several  connected  cluster  heads 
whereby  for  node  i,  we  have  at  =  bi  +  Fji,  pik  =  Fik/cti, 
Pirn  Fim  /  Cti- 


Algorithm  1  Information  exchange  at  each  cluster  head  i 

1:  Receive  the  value  of  dU/dbk  from  each  of  its  downstream 
neighbors  j  ^  q  ( pij  >  0). 

2:  If  any  downstream  neighbor  also  sends  a  special  blocking  tag, 
put  that  neighbor  node  in  a  set  B(i). 

3:  Calculate  the  value  dU /dbi  from 


dU_ 

dbi 


E 


Pij 


uij  + 


i  £  Af\q. 


4:  For  each  downstream  neighbor  j  A  q,  determine  if  the  following 
conditions  are  satisfied: 

„  J  dU  ^  dU 
PH  >  0  and  oTT  —  or-  ’ 


pij  > 


a,i 


dbi  dbj 
TT>  dU  dU 

u-  +  w-~ 


dbi 


that  are  suitable  for  distributed  iterative  computation.  Let  a, 
be  the  total  traffic  or  simply  the  node  flow  at  cluster  head  i.  cii 
includes  both  bi  and  the  traffic  from  other  cluster  heads  that 
traverses  through  i,  i.e., 

at=bi+  E  Fjii  i G  -N\q-  (10) 

Eq.  (10)  also  implicitly  states  the  conservation  of  flow  at  each 
cluster  head,  i.e.,  the  expected  traffic  into  a  cluster  head  is 
equal  to  the  expected  traffic  out  of  it.  For  a,  t  0,  we  let 
0  <  Pij  <  1  be  the  fraction  of  the  node  flow  a;  that  is  routed 
over  link  (i,j),  i.e., 

Pij  =  —,  i  tAf\q,{i,j)  £  A.  (11) 

CLi 

We  take  pqj  =  0  for  any  j  £  Af\q  and  ( q,j )  £  A  such  that 
traffic  which  has  reached  the  fusion  center  is  not  sent  back 
into  the  network.  It  is  obvious  that 

E  Pij  =  V*  G  -E1?!  (12) 


where  w  is  a  scale  factor.  If  the  above  conditions  are  satisfied, 
add  k  into  the  set  B{i). 

5:  Broadcast  dU /dbi  to  the  upstream  neighbors.  If  \B(i)\  t  0,  also 
send  a  blocking  tag. 


Algorithm  2  Variable  update  at  each  cluster  head  i 

1:  for  all  j  £  {j  :  ( i,j )  £  A}  do 
2:  if  j  £  B(i)  then 

3:  pij  <—  0,  and  Ay  <—  0. 

4:  else 

5:  Compute 

(>  dU\  .  (>  dU\ 

Zij  =  Uij  +  —  -  mm  Uik  +  ttt  , 

\  dbj  J  {k:(i,k)eA,k^B(i)}  \  dbk  J 

and  let  kmin  he  a  value  of  k  which  achieves  the  minimiza¬ 
tion  in  the  above  equation. 

6:  Compute 

.  .  (  VJ  ■  z 

Ay  =  mm  pij , - 

V  CLi 

7:  Update  the  routing  variable  pik'. 

Pij  ~  A  ij,  it  kmin 

Pij  “F  )  '  A  ,  j  =  kmin 
j^^min 


which  is  another  way  to  describe  the  conservation  of  flow. 
Fig.  3  provides  a  simple  example  to  illustrate  these  parameters. 
Further,  we  say  there  exists  a  routing  path  for  a  cluster  head 
node  i  £  Af\q  to  the  fusion  center  q,  if  there  is  a  sequence  of 
nodes,  i,  j,  ■  ■  ■  ,  k,  q  such  that  pij  >  0,  •  •  •  ,  pkq  >  0. 

Let  p  be  a  column  vector  which  contains  all  the  elements 
(in  some  order)  of  the  set  {pij,  ( i,j )  £  A,  i  t  <?}■  Let  a  be 
the  node  flow  vector  containing  a,  for  each  i  £  Af\q,  b  be 
the  input  traffic  vector  with  elements  bi  for  each  i  £  A f\q, 
and  F  be  the  vector  of  Fij,  V(i,j)  £  A.  We  also  define  a  set 


n  =  {  p 


Pij  L  0,  El 


Pij  =  l,V(i,  j)  £  A,  Vi  £  A f\q 


Let  be  a  subset  of  F2  consisting  of  all  p  for  which  there 
exists  a  routing  path  for  every  cluster  head  node  i  £  A f\q  to 
the  fusion  center.  It  can  be  verified  that  for  each  p  £  Ft  and 
b  with  bi  >  0,  the  set  of  equations  (10)  and  (11)  has  unique 
solutions  for  a  and  F,  whose  components  can  be  written  as 
functions  of  b  and  p,  i.e.,  a;(p,  b)  and  Fjj (p,  b),  respectively. 
On  the  other  hand,  for  every  b  with  bij  >  0  and  every  F 
which  is  feasible  for  P.2,  there  exists  a  p  £  FI  which,  when 


8:  end  if 

9:  end  for 


plugged  into  the  function  Fjj(p ,  b),  will  produce  F.  Thus,  for 
each  input  set  b,  there  exists  a  unique  mapping  between  the 
two  sets  of  routing  variables,  F  and  p.  Such  unique  mapping 
enables  us  to  reformulate  P.2  in  terms  of  the  new  set  of  routing 
variables  p,  and  the  resulting  new  formulation  is  given  by 

P.3  :  min  U (p,  b)  =  E  UU  \FU  (P>  b)] 

(*.j) 

s.t.  p  £  FI. 

B.  A  Simple  Distributed  Routing  Algorithm 

There  are  several  algorithms  that  can  be  used  to  solve  P.3, 
which  allow  each  cluster  head  i  to  determine  its  own  routing 
variables  pij.  Among  them  are  [18]-[20].  As  the  algorithm 
given  in  [18]  requires  only  the  first-order  derivative  of  the 
objective  function  and  demands  less  computational  effort,  we 
choose  to  adapt  such  algorithm  to  solve  P.3  in  a  distributed 
fashion.  The  final  routing  algorithm  is  composed  of  two  parts: 
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^ target  :  °  2  : 
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Fig.  4:  A  snapshot  for  a  particular  cluster,  where  each  sen¬ 
sor  that  participates  in  sensing  is  annotated  with  a  3-tuple 

(A,  |A|,  A)- 


1)  a  protocol  for  information  exchange  between  neighboring 
cluster  heads,  as  summarized  in  Algorithm  1;  2)  a  method  for 
updating  the  routing  variables  at  each  cluster  head,  as  shown 
by  Algorithm  2.  In  both  Algorithm  1  and  2,  we  use  Ul:)  for 
dUij / d(Fij)  for  short6.  In  Algorithm  1,  we  also  distinguish 
between  a  cluster  head's  neighbors.  We  say  cluster  head  j  is 
downstream  from  cluster  head  i  if  there  is  a  directed  path  from 
i  to  j,  and  for  every  link  (fc,  to)  on  the  path,  we  have  pkm  >  0. 
We  say  cluster  head  i  is  upstream  from  cluster  head  j  if  j  is 
downstream  from  i.  Further,  if  there  is  no  pair  of  cluster  heads 

i, j  such  that  i  is  both  upstream  and  downstream  from  node 

j,  we  say  the  routing  is  loop  free. 

This  distributed  routing  algorithm  can  be  implemented 
independently  at  each  cluster  head  i.  It  successively  updates 
the  routing  variables  pij  based  on  information  communicated 
in  an  orderly  fashion  from  neighboring  cluster  heads.  The  opti¬ 
mality  and  the  convergence  of  this  algorithm  are  both  ensured, 
as  proved  originally  in  [18].  Further,  it  has  a  property  of  being 
loop  free  at  every  iteration,  which  can  prevent  the  potential 
deadlock  in  Algorithm  1  during  the  information  exchange 
between  neighboring  cluster  heads.  When  initializing  this 
algorithm,  it  is  also  desirable  to  start  with  a  loop  free  p  £  O. , 
for  example,  the  min-hop  path  or  the  greedy  path.  After 
initialization,  each  cluster  head  will  run  the  algorithm  locally 
for  multiple  iterations  until  the  total  energy  cost  converges. 
Since  this  routing  algorithm  is  implemented  in  a  distributed 
fashion,  it  can  react  rapidly  to  a  local  disturbance  and  result 
in  a  slower  line  tuning  in  the  rest  of  the  network.  It  is  thus 
also  applicable  when  the  input  statistics  are  slowly  changing 
with  time. 


6Note  that  each  cluster  head  can  estimate,  as  a  time  average,  the  traffic  F,;! 
for  each  outgoing  link,  and  the  estimated  Fij  can  be  used  to  calculate  t/. . . 


TABLE  I:  Transmission  Sequence 


Type 

Description 

Sequence 

y  Pi 

1 

ordered  ipi 

9,5,4,10,8,3,6,1 

8 

2 

ordered  A,;| 

1,2,3 

3 

3 

ordered  Di 

1,3, 8, 2 

4 

4 

Hi  (A,  IMA) 

1,5, 9, 3 

4 

5 

H2(A,  Aj ,  A) 

1,  3,  5,  8 

4 

VI.  Simulation  Results 

In  this  section,  we  present  some  simulation  results  to 
illustrate  this  distributed  framework.  We  firstly  demonstrate 
the  advantages  of  the  proposed  distributed  scheduling  ap¬ 
proach.  Consider  a  cluster  as  shown  in  Fig.  4,  where  15 
sensors  are  randomly  distributed  in  a  200m  x  200m.  region, 
and  the  cluster  head  is  located  at  (100,  100).  We  assume 
N  =  10  sensors  are  chosen  to  probe  a  range  resolution 
cell  that  contains  a  target.  To  reflect  the  difference  in  each 
sensor’s  battery  level,  we  designate  a  uniformly  distributed 
random  number  in  the  interval  of  [2000,25000]  (in  pi)  as 
each  sensor’s  current  energy.  When  computing  A  and  A,  for 
each  sensor,  we  consider  both  the  sensor-target  distance  and 
the  scattering  effect.  For  this  target-present  case,  we  assume 
Ai  >  0  for  simplicity.  To  estimate  the  consumed  energy 
for  data  transmission  from  each  sensor  to  the  cluster  head, 
we  adopt  a  simple  path  loss  model.  We  assign  a  3-tuple 
( A ,  |Aj|,  A)  to  each  of  the  N  sensors,  as  shown  in  Fig.  4. 
This  scenario  represents  a  snapshot  of  this  particular  cluster. 

In  this  example,  we  consider  five  distributed  scheduling 
approaches,  which  are  the  following: 

1)  Ordered  ipp.  a  pure  energy  based  approach  which  sched¬ 
ules  sensors  with  larger  ipi  for  earlier  transmissions; 

2)  Ordered  |Aj|:  this  is  the  approach  proposed  in  [10], 
which  favors  sensors  with  more  extreme  observations; 

3)  Ordered  Dp.  this  approach  gives  higher  priority  to 
sensors  with  larger  contributions  to  the  detection  per¬ 
formance; 

4)  Hi(A,  |Aj|,  ipi):  our  proposed  approach  that  jointly 
considers  A>  |A,|,  and  ipi.  Hi  is  defined  as: 

Hi  (A,  |Aj|,  ipi)  =  2  A  +  |  Ai|  +  A/1500; 

5)  LAA,  | Ai| ,  ipi):  H2  is  a  nonlinear  function,  and  is 
defined  as 

H2(A,  1  Ail,  A)  =  2 A  +  | A., I  -  105/A- 

Note  that  the  nonlinear  part  105/A  in  H2  can  be  regarded 
as  a  measure  of  the  reluctance  of  sensor  i  in  transmitting. 
For  Approach  1-3,  we  assume  the  same  stopping  rule,  i.e., 
A.  piAi  >  7],  where  77  is  the  test  threshold  and  is  set  as  40  for 
this  example.  For  Approach  4-5,  the  stopping  rule  is  y  p,A,  > 
r)  and  y  Pi  A  >  A,  where  A  is  chosen  as  10. 

In  Table  I,  we  list  the  resulting  transmission  sequences 
under  these  approaches.  In  Fig.  5,  we  further  compare  the 
performance  of  these  approaches,  in  terms  of  the  overall 
Kullback-Leibler  distance  (y  p,  A),  the  accumulated  log- 
likelihood  ratios  (y  ptA,),  the  minimum  residual  energy 
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Fig.  5:  Comparison  of  the  scheduling  approaches  in  terms  of 

J^PiDi,  min  V’i,  and  Y  Piei- 

(min  ipi),  and  the  total  energy  consumption  (Y  Pi^i).  It  is 
observed  that  Approach  1  achieves  the  largest  value  of  min^j, 
as  it  protects  sensors  with  smaller  ipi  by  not  scheduling 
them  to  transmit.  However,  without  considering  the  detection 
performance  at  all,  Approach  1  results  in  a  dramatic  increase 
in  both  the  amount  of  consumed  energy  and  the  number  of 
transmissions.  Approach  2  has  the  least  number  of  trans¬ 
missions.  But  without  considering  D,,  it  does  not  achieve 
the  desired  detection  performance  in  this  specific  example 
(i.e.,  Y  PiDi  <  10).  It  may  also  result  in  draining  out  a 
certain  sensor  earlier  (i.e.,  small  value  of  min^j).  Approach 
3  satisfies  the  detection  performance  requirement  in  terms  of 
YPiDi  >  10.  But  without  considering  energy,  this  approach 
has  mediocre  performance  in  both  min^j  and  Y  Piei-  Ap- 
proach  4  and  5,  although  dictated  by  different  H  functions, 
both  achieve  better  performance-energy  balances  than  the 
other  schemes.  Their  weighted  combinations  allow  a  decision 
to  be  made  within  a  moderate  number  of  transmissions  and 
with  the  desired  level  of  confidence.  Furthermore,  they  also 
successfully  avoid  burdening  nodes  with  small  residual  energy, 
and  achieve  the  minimal  amount  of  total  energy  consumption. 

Next,  we  illustrate  the  performance  of  the  distributed 
routing  algorithm  through  simulating  a  simple  example.  We 
assume  9  cluster  heads  and  a  fusion  center  (labeled  as  q)  are 
distributed  over  a  field  of  1600m  x  1600m,  and  we  set  the 
transmit  range  of  each  cluster  head  to  500m.  The  topology 
is  shown  in  Fig.  6.  To  estimate  e^,  we  compute  the  distance 
between  each  pair  of  linked  cluster  heads,  and  apply  a  simple 
path  loss  model  where  the  path  loss  exponent  is  set  to  2.  We 
assume  that  for  each  link  in  Fig.  6,  cl:j  is  known  and  fixed, 
and  we  adopt  Eq.  (9)  for  the  cost  function  Uij(Fij)  where  a 
is  set  to  0.1.  The  unit  for  the  total  cost  YUij(Fij)  can  be 
deemed  to  be  nJ/s,  although  Uij(Fij)  in  (9)  also  includes  a 
penalty  part. 

We  choose  vo  =  2  for  the  scale  parameter  in  Algorithm 
1  and  2,  and  simply  adopt  the  greedy  path  to  initialize  the 
distributed  routing  algorithm.  We  generate  different  sets  of 
input  traffic  b,  given  that  b,  (bits)  is  a  uniform  random  number 


Fig.  6:  Network  topology  used  for  the  simulation  example. 


Fig.  7:  Convergence  of  the  total  cost  Y  Uij(Fij)  given  differ¬ 
ent  sets  of  input  traffic  b. 

over  the  interval  [100,200].  For  each  set  of  input  traffic,  we 
run  the  distributed  routing  algorithm  at  each  cluster  head  for 
multiple  iterations.  For  each  iteration,  we  calculate  and  record 
the  resulting  total  cost  YUij{Fij).  We  plot  the  value  of 
Y  Uij  ( Fij )  versus  the  iteration  number  for  4  different  sets 
of  b  in  Fig.  7.  It  is  clearly  seen  that  the  value  of  the  energy 
cost  function  converges  smoothly  to  the  optimal  value,  and 
the  convergence  takes  roughly  25  iterations. 

VII.  Conclusion 

In  this  paper,  we  proposed  a  distributed  and  energy- 
efficient  framework  for  the  problem  of  Neyman-Pearson  detec¬ 
tion  of  fluctuating  radar  signals  in  large-scale  sensor  networks. 
Our  framework  is  scalable  with  respect  to  the  network  size, 
and  can  greatly  alleviate  the  requirement  on  the  storage 
and  processing  capability  of  the  fusion  center.  It  adopts  a 
clustering  architecture,  and  includes  a  distributed  scheduling 
protocol  and  a  distributed  routing  protocol  to  handle  infor¬ 
mation  transmissions  at  intra-cluster  and  inter-cluster  levels, 
respectively.  The  distributed  scheduling  protocol  allows  each 
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sensor  inside  a  cluster  to  make  an  autonomous  decision  about 
when  to  send  data  to  the  cluster  head,  according  to  its  own 
information  and  without  consulting  any  other  sensors.  The 
distributed  routing  protocol  enables  each  cluster  head  to  make 
its  own  routing  decisions,  with  its  available  information  and 
information  obtained  from  consultation  with  immediate  neigh¬ 
bors.  Simulation  examples  demonstrate  the  superior  tradeoff 
attained  by  the  distributed  scheduling  approach  between  the 
detection  performance  and  the  energy  efficiency,  and  the  fast 
convergence  of  the  distributed  routing  algorithm  to  the  optimal 
value  of  the  total  energy  cost. 
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